Protein folding trajectory analysis using patterned clusters
نویسندگان
چکیده
Understanding how protein folds into a functional and structural configuration is arguably one of the most important and challenging problems in computational biology. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus the reaction coordinates such as the fraction of native contacts, the radius of gyration, the principal components and so on. In this paper, we present a combinatorial algorithmic approach towards understanding the global state changes of the configurations. The approach is based on cluster computation, each cluster being defined by a pattern of a combination of various reaction coordinates. We present an algorithm of time complexity O((N + nm) log n) where N is the size of the output and n×m is the size of the input. To date, this is the best time complexity for the problem. We next demonstrate that this approach extracts crucial information about protein folding intermediate states and mechanism. (1) The method recovers states previously obtained by visually analyzing free energy contour maps. (2) It also succeeds in extracting meaningful patterns and structures that had been overlooked in previous works, which provide a better understanding of the folding mechanism (of a β-hairpin protein). These new patterns also interconnect various states in existing free energy contour maps versus different reaction coordinates. (3) The approach does not require the free energy values, yet it offers analysis comparable and sometimes better than the methods that use free energy landscapes, thus validating the choice of reaction coordinates.
منابع مشابه
Combinatorial Pattern Discovery Approach for the Folding Trajectory Analysis of a β-Hairpin
The study of protein folding mechanisms continues to be one of the most challenging problems in computational biology. Currently, the protein folding mechanism is often characterized by calculating the free energy landscape versus various reaction coordinates, such as the fraction of native contacts, the radius of gyration, RMSD from the native structure, and so on. In this paper, we present a ...
متن کاملItinerary profiling to analyze a large number of protein-folding trajectories
Understanding how proteins fold through a vast number of unfolded states is a major subject in the study of protein folding. Herein, we present itinerary profiling as a simple method to analyze molecular dynamics trajectories, and apply this method to Trp-cage. In itinerary profiling, structural clusters included in a trajectory are represented by a bit sequence, and a number of trajectories, a...
متن کاملSelf Consistent Path Sampling: Making Accurate All-Atom Protein Folding Simulations Possible on Small Computer Clusters
We introduce a powerful iterative algorithm to compute protein folding pathways, with realistic all-atom force fields. Using the path integral formalism, we explicitly derive a modified Langevin equation which samples directly the ensemble of reactive pathways, exponentially reducing the cost of simulating thermally activated transitions. The algorithm also yields a rigorous stochastic estimate...
متن کاملTrajectory Clustering Using Longer Length Units for Automatic Speech Recognition
One of the major deficiencies of conventional hidden Markov modelling (HMM) is known as the trajectory folding phenomenon. Multipath Models can solve the trajectory folding problem by assuming that a large part of the variation in acoustic data can be attributed to different observation classes and which can then be modelled separately. In this paper, we present an approach to automatically clu...
متن کاملAnalysis of Molecular Dynamics Simulations of Protein Folding by Aruna Rajan
Microsecond long Molecular Dynamics (MD) trajectories of biomolecular processes are now possible due to advances in computer technology. Soon, trajectories long enough to probe dynamics over many milliseconds will become available. Since these timescales match the physiological timescales over which many small proteins fold, all atom MD simulations of protein folding are now becoming popular. T...
متن کامل